Goto

Collaborating Authors

 Palembang


Searching the Title of Practical Work of the Informatics Engineering Bachelor Program with the Case Base Reasoning Method

Jaya, Agung Sukrisna, Arsalan, Osvari, Saputra, Danny Matthew

arXiv.org Artificial Intelligence

The advancement of technology and information has led to a rapid growth in various fields. Undoubtedly, the global community extensively relies on technology as a solution to address the myriad challenges of the contemporary world. One prominent application is the search systems, which offer efficient methods for locating specific information within vast data collections. For instance, a search system can be employed to locate titles of student practical work [1]. A search engine is the practical application of information retrieval techniques for large the term "search engine" was originally term "Search Engine" was originally used to refer to specialized hardware for text searching [2]. Among the problem-solving techniques rooted in historical knowledge, Case-Based Reasoning stands out.


Vehicle detection from GSV imagery: Predicting travel behaviour for cycling and motorcycling using Computer Vision

Kyriaki, null, Kokka, null, Goel, Rahul, Abbas, Ali, Nice, Kerry A., Martial, Luca, Labib, SM, Ke, Rihuan, Schönlieb, Carola Bibiane, Woodcock, James

arXiv.org Artificial Intelligence

Transportation influence health by shaping exposure to physical activity, air pollution and injury risk. Comparative data on cycling and motorcycling behaviours is scarce, particularly at a global scale. Street view imagery, such as Google Street View (GSV), combined with computer vision, is a valuable resource for efficiently capturing travel behaviour data. This study demonstrates a novel approach using deep learning on street view images to estimate cycling and motorcycling levels across diverse cities worldwide. We utilized data from 185 global cities. The data on mode shares of cycling and motorcycling estimated using travel surveys or censuses. We used GSV images to detect cycles and motorcycles in sampled locations, using 8000 images per city. The YOLOv4 model, fine-tuned using images from six cities, achieved a mean average precision of 89% for detecting cycles and motorcycles. A global prediction model was developed using beta regression with city-level mode shares as outcome, with log transformed explanatory variables of counts of GSV-detected images with cycles and motorcycles, while controlling for population density. We found strong correlations between GSV motorcycle counts and motorcycle mode share (0.78) and moderate correlations between GSV cycle counts and cycling mode share (0.51). Beta regression models predicted mode shares with $R^2$ values of 0.614 for cycling and 0.612 for motorcycling, achieving median absolute errors (MDAE) of 1.3% and 1.4%, respectively. Scatterplots demonstrated consistent prediction accuracy, though cities like Utrecht and Cali were outliers. The model was applied to 60 cities globally for which we didn't have recent mode share data. We provided estimates for some cities in the Middle East, Latin America and East Asia. With computer vision, GSV images capture travel modes and activity, providing insights alongside traditional data sources.


Sentiment Analysis of Lithuanian Online Reviews Using Large Language Models

Vileikytė, Brigita, Lukoševičius, Mantas, Stankevičius, Lukas

arXiv.org Artificial Intelligence

Sentiment analysis is a widely researched area within Natural Language Processing (NLP), attracting significant interest due to the advent of automated solutions. Despite this, the task remains challenging because of the inherent complexity of languages and the subjective nature of sentiments. It is even more challenging for less-studied and less-resourced languages such as Lithuanian. Our review of existing Lithuanian NLP research reveals that traditional machine learning methods and classification algorithms have limited effectiveness for the task. In this work, we address sentiment analysis of Lithuanian five-star-based online reviews from multiple domains that we collect and clean. We apply transformer models to this task for the first time, exploring the capabilities of pre-trained multilingual Large Language Models (LLMs), specifically focusing on fine-tuning BERT and T5 models. Given the inherent difficulty of the task, the fine-tuned models perform quite well, especially when the sentiments themselves are less ambiguous: 80.74% and 89.61% testing recognition accuracy of the most popular one- and five-star reviews respectively. They significantly outperform current commercial state-of-the-art general-purpose LLM GPT-4. We openly share our fine-tuned LLMs online.


6GSoft: Software for Edge-to-Cloud Continuum

Akbar, Muhammad Azeem, Esposito, Matteo, Hyrynsalmi, Sami, Kumar, Karthikeyan Dinesh, Lenarduzzi, Valentina, Li, Xiaozhou, Mehraj, Ali, Mikkonen, Tommi, Moreschini, Sergio, Mäkitalo, Niko, Oivo, Markku, Paavonen, Anna-Sofia, Parveen, Risha, Smolander, Kari, Su, Ruoyu, Systä, Kari, Taibi, Davide, Yang, Nan, Zhang, Zheying, Zohaib, Muhammad

arXiv.org Artificial Intelligence

In the era of 6G, developing and managing software requires cutting-edge software engineering (SE) theories and practices tailored for such complexity across a vast number of connected edge devices. Our project aims to lead the development of sustainable methods and energy-efficient orchestration models specifically for edge environments, enhancing architectural support driven by AI for contemporary edge-to-cloud continuum computing. This initiative seeks to position Finland at the forefront of the 6G landscape, focusing on sophisticated edge orchestration and robust software architectures to optimize the performance and scalability of edge networks. Collaborating with leading Finnish universities and companies, the project emphasizes deep industry-academia collaboration and international expertise to address critical challenges in edge orchestration and software architecture, aiming to drive significant advancements in software productivity and market impact.


Parkinson's Disease Detection through Vocal Biomarkers and Advanced Machine Learning Algorithms

Sayed, Md Abu, Tayaba, Maliha, Islam, MD Tanvir, Pavel, Md Eyasin Ul Islam, Mia, Md Tuhin, Ayon, Eftekhar Hossain, Nob, Nur, Ghosh, Bishnu Padh

arXiv.org Artificial Intelligence

Parkinson's disease (PD) is a prevalent neurodegenerative disorder known for its impact on motor neurons, causing symptoms like tremors, stiffness, and gait difficulties. This study explores the potential of vocal feature alterations in PD patients as a means of early disease prediction. This research aims to predict the onset of Parkinson's disease. Utilizing a variety of advanced machine-learning algorithms, including XGBoost, LightGBM, Bagging, AdaBoost, and Support Vector Machine, among others, the study evaluates the predictive performance of these models using metrics such as accuracy, area under the curve (AUC), sensitivity, and specificity. The findings of this comprehensive analysis highlight LightGBM as the most effective model, achieving an impressive accuracy rate of 96% alongside a matching AUC of 96%. LightGBM exhibited a remarkable sensitivity of 100% and specificity of 94.43%, surpassing other machine learning algorithms in accuracy and AUC scores. Given the complexities of Parkinson's disease and its challenges in early diagnosis, this study underscores the significance of leveraging vocal biomarkers coupled with advanced machine-learning techniques for precise and timely PD detection.


Deep Learning for Diverse Data Types Steganalysis: A Review

Kheddar, Hamza, Hemis, Mustapha, Himeur, Yassine, Megías, David, Amira, Abbes

arXiv.org Artificial Intelligence

Steganography and steganalysis are two interrelated aspects of the field of information security. Steganography seeks to conceal communications, whereas steganalysis is aimed to either find them or even, if possible, recover the data they contain. Steganography and steganalysis have attracted a great deal of interest, particularly from law enforcement. Steganography is often used by cybercriminals and even terrorists to avoid being captured while in possession of incriminating evidence, even encrypted, since cryptography is prohibited or restricted in many countries. Therefore, knowledge of cutting-edge techniques to uncover concealed information is crucial in exposing illegal acts. Over the last few years, a number of strong and reliable steganography and steganalysis techniques have been introduced in the literature. This review paper provides a comprehensive overview of deep learning-based steganalysis techniques used to detect hidden information within digital media. The paper covers all types of cover in steganalysis, including image, audio, and video, and discusses the most commonly used deep learning techniques. In addition, the paper explores the use of more advanced deep learning techniques, such as deep transfer learning (DTL) and deep reinforcement learning (DRL), to enhance the performance of steganalysis systems. The paper provides a systematic review of recent research in the field, including data sets and evaluation metrics used in recent studies. It also presents a detailed analysis of DTL-based steganalysis approaches and their performance on different data sets. The review concludes with a discussion on the current state of deep learning-based steganalysis, challenges, and future research directions.


Deteksi Sampah di Permukaan dan Dalam Perairan pada Objek Video dengan Metode Robust and Efficient Post-Processing dan Tubelet-Level Bounding Box Linking

Tjandra, Bryan, Negara, Made S. N., Handoko, Nyoo S. C.

arXiv.org Artificial Intelligence

Indonesia, as a maritime country, has a significant portion of its territory covered by water. Ineffective waste management has resulted in a considerable amount of trash in Indonesian waters, leading to various issues. The development of an automated trash-collecting robot can be a solution to address this problem. The robot requires a system capable of detecting objects in motion, such as in videos. However, using naive object detection methods in videos has limitations, particularly when image focus is reduced and the target object is obstructed by other objects. This paper's contribution provides an explanation of the methods that can be applied to perform video object detection in an automated trash-collecting robot. The study utilizes the YOLOv5 model and the Robust & Efficient Post Processing (REPP) method, along with tubelet-level bounding box linking on the FloW and Roboflow datasets. The combination of these methods enhances the performance of naive object detection from YOLOv5 by considering the detection results in adjacent frames. The results show that the post-processing stage and tubelet-level bounding box linking can improve the quality of detection, achieving approximately 3% better performance compared to YOLOv5 alone. The use of these methods has the potential to detect surface and underwater trash and can be applied to a real-time image-based trash-collecting robot. Implementing this system is expected to mitigate the damage caused by trash in the past and improve Indonesia's waste management system in the future.


Workload Forecasting of a Logistic Node Using Bayesian Neural Networks

Nakilcioglu, Emin, Rendel, Anisa Rizvanolli und Olaf

arXiv.org Artificial Intelligence

Purpose: Traffic volume in empty container depots has been highly volatile due to external factors. Forecasting the expected container truck traffic along with having a dynamic module to foresee the future workload plays a critical role in improving the work efficiency. This paper studies the relevant literature and designs a forecasting model addressing the aforementioned issues. Methodology: The paper develops a forecasting model to predict hourly work and traffic volume of container trucks in an empty container depot using a Bayesian Neural Network based model. Furthermore, the paper experiments with datasets with different characteristics to assess the model's forecasting range for various data sources. Findings: The real data of an empty container depot is utilized to develop a forecasting model and to later verify the capabilities of the model. The findings show the performance validity of the model and provide the groundwork to build an effective traffic and workload planning system for the empty container depot in question. Originality: This paper proposes a Bayesian deep learning-based forecasting model for traffic and workload of an empty container depot using real-world data. This designed and implemented forecasting model offers a solution with which every actor in the container truck transportation benefits from the optimized workload.


The Week in Detail: AI, party presidents, and food banks

#artificialintelligence

Every weekday, The Detail makes sense of the big news stories. This week, we talked about the burgeoning concerns over artificial intelligence, talked to two former political party presidents about their hidden role, visited a food bank operating in the wealthy North Shore, looked at the fight to keep foot-and-mouth disease out of our farms, and finished the week with a new Supreme Court case trying to hold big corporations liable for contributing to climate change. Whakarongo mai to any episodes you might have missed. Artificial intelligence systems running rogue might seem like the stuff of science-fiction, but these systems are increasingly common in many high-tech elements of society, from self-driving cars to digital assistants, facial identification, Netflix recommendations, and much, much more. The capabilities of artificial intelligence are growing at pace; a pace that's outstripping regulatory frameworks.


Personalized Recommender System for Children's Book Recommendation with A Realtime Interactive Robot

Liu, Yun, Gao, Tianmeng, Song, Baolin, Huang, Chengwei

arXiv.org Artificial Intelligence

In this paper we study the personalized book recommender system in a child-robot interactive environment. Firstly, we propose a novel text search algorithm using an inverse filtering mechanism that improves the efficiency. Secondly, we propose a user interest prediction method based on the Bayesian network and a novel feedback mechanism. According to children's fuzzy language input, the proposed method gives the predicted interests. Thirdly, the domain specific synonym association is proposed based on word vectorization, in order to improve the understanding of user intention. Experimental results show that the proposed recommender system has an improved performance and it can operate on embedded consumer devices with limited computational resources.